首页> 外文OA文献 >Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System using Deep Recurrent Neural Networks
【2h】

Speech Enhancement for a Noise-Robust Text-to-Speech Synthesis System using Deep Recurrent Neural Networks

机译:使用深度递归神经网络的噪声鲁棒文本到语音合成系统的语音增强

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Quality of text-to-speech voices built from noisy recordings is diminished. In order to improve it we propose the use of a recurrent neural network to enhance acoustic parameters prior to training. We trained a deep recurrent neural network using a parallel database of noisy and clean acoustics parameters as input and output of the network. The database consisted of multiple speakers and diverse noise conditions. We investigated using text-derived features as an additional input of the network. We processed a noisy database of two other speakers using this network and used its output to train an HMM acoustic text-to-synthesis model for each voice. Listening experiment results showed that the voice built with enhanced parameters was ranked significantly higher than the ones trained with noisy speech and speech that has been enhanced using a conventional enhancement system. The text-derived features improved results only for the female voice, where it was ranked as highly as a voice trained with clean speech.
机译:从嘈杂的录音中建立的文本到语音的质量下降。为了改善它,我们建议在训练之前使用递归神经网络来增强声学参数。我们使用包含噪声和干净声学参数的并行数据库作为网络的输入和输出来训练深度递归神经网络。该数据库由多个发言人和各种噪声条件组成。我们调查了使用文本衍生功能作为网络的附加输入的情况。我们使用此网络处理了其他两个扬声器的嘈杂数据库,并使用其输出来训练每种语音的HMM声学文本合成模型。聆听实验结果表明,使用增强参数构建的语音的等级明显高于使用嘈杂语音和使用常规增强系统增强的语音训练的语音。源自文本的功能仅针对女性语音改善了结果,在女性语音中,它的发音与经过纯净语音训练的语音一样高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号